A Proposal for Lexical Disambiguation
نویسندگان
چکیده
A method of sense resolution is proposed that is based on WordNet, an on-line lexical database that incorporates semantic relations (synonymy, antonymy, hyponymy, meronymy, causal and troponymic entailment) as labeled pointers between word senses. With WordNet, it is easy to retrieve sets of semantically rehted words, a facility that will be used for sense resolution during text processing, as follows. When a word with multiple senses is encountered, one of two procedures will be followed. Either, (1) words related in meaning to the alternative senses of the polysemous word will be retrieved; new strings will be derived by substituting these related words into the context of the polysemous word; a large textual corpus will then be searched for these derived strings; and that sense will be chosen that corresponds to the derived string that is found most often in the corpus. Or, (2) the context of the polysemous word will be used as a key to search a large corpus; all words found to occur in that context will be neted; Word.Net will then be used to estimate the semantic distance from those words to the alternative senses of the polysemous word; and that sense will be chosen that is closest in meaning to other words occurring in the same contexL If successful, this procedure could have practical applications to problems of information retrieval, mechan/cal translation, intelligent tutoring systems, and elsewhere.
منابع مشابه
Qualia Based Lexical Knowledge for the Disambiguation of the Japanese Postposition 'No' (short paper)
This paper proposes the elaboration of the qualia structure of the Generative Lexicon in [5] and the Extended Generative Lexicon theory [3]. My proposal is based on the Japanese genitive postposition no. The Japanese “NP1-no NP2” construction expresses a wider range of relations between two entities than does the English possessive “NP1’s NP2,” such that the Pustejovskian qualia roles encoded i...
متن کاملA Proposal for Word Sense Disambiguation using Conceptual Distance
This paper presents a method for the resolution of lexical ambiguity and its automatic evaluation over the Brown Corpus. The method relies on the use of the wide-coverage noun taxonomy of WordNet and the notion of conceptual distance among concepts, captured by a Conceptual Density formula developed for this purpose. This fully automatic method requires no hand coding of lexical entries, hand t...
متن کاملChinese Base-Phrases Chunking
This paper introduces new definitions of Chinese base phrases and presents a hybrid model to combine Memory-Based Learning method and disambiguation proposal based on lexical information and grammar rules populated from a large corpus for 9 types of Chinese base phrases chunking. Our experiment achieves an accuracy (F-measure) of 93.4%. The significance of the research lies in the fact that it ...
متن کاملA Hybrid Model for Word Sense Disambiguation in English- Portuguese Machine Translation
We present the proposal for an approach to word sense disambiguation with application in machine translation from English to Brazilian Portuguese. This approach follows a hybrid natural language processing method, that is, a mixture of knowledge and corpus-based approaches. The main innovative feature is the formalism that we intend to use to represent the instances and the background knowledge...
متن کاملRace-Based Parsing and Syntactic Disambiguation
We present a processing model that integrates some important psychological claims about the human sentence parsing mechanism, namely that processing is innuenced by limitations on working memory and by various syntactic preferences. The model uses time-constraint information to resolve connicting preferences in a psychologically plausible way. The starting point for this proposal is the Sausage...
متن کاملProduction of English Lexical Stress by Persian EFL Learners
This study examines the phonetic properties of lexical stress in English produced by Persian speakers learning English as a foreign language. The four most reliable phonetic correlates of English lexical stress, namely fundamental frequency, duration, intensity, and vowel quality were measured across Persian speakers’ production of the stressed and unstressed syllables of five English disyllabi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1991